Inferring Human Attention by Learning Latent Intentions
نویسندگان
چکیده
This paper addresses the problem of inferring 3D human attention in RGB-D videos at scene scale. 3D human attention describes where a human is looking in 3D scenes. We propose a probabilistic method to jointly model attention, intentions, and their interactions. Latent intentions guide human attention which conversely reveals the intention features. This mutual interaction makes attention inference a joint optimization with latent intentions. An EM-based approach is adopted to learn the latent intentions and model parameters. Given an RGB-D video with 3D human skeletons, a jointstate dynamic programming algorithm is utilized to jointly infer the latent intentions, the 3D attention directions, and the attention voxels in scene point clouds. Experiments on a new 3D human attention dataset prove the strength of our method.
منابع مشابه
Where and Why Are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks
This paper addresses a new problem jointly inferring human attention, intentions, and tasks from videos. Given an RGB-D video where a human performs a task, we answer three questions simultaneously: 1) where the human is looking attention prediction; 2) why the human is looking there intention prediction; and 3) what task the human is performing task recognition. We propose a hierarchical model...
متن کاملCognitive Interactive Robot Learning
Building general purpose autonomous robots that suit a wide range of user-specified applications, requires a leap from today’s task-specific machines to more flexible and general ones. To achieve this goal, one should move from traditional preprogrammed robots to learning robots that easily can acquire new skills. Learning from Demonstration (LfD) and Imitation Learning (IL), in which the robot...
متن کاملLearning Mental States from Biosignals
Aalto University, P.O. Box 11000, FI-00076 Aalto www.aalto.fi Author Melih Kandemir Name of the doctoral dissertation Learning Mental States from Biosignals Publisher School of Science Unit Department of Information and Computer Science Series Aalto University publication series DOCTORAL DISSERTATIONS 61/2013 Field of research Computer and Information Science Manuscript submitted 20 November 20...
متن کاملSocial cognition and the brain: a meta-analysis.
This meta-analysis explores the location and function of brain areas involved in social cognition, or the capacity to understand people's behavioral intentions, social beliefs, and personality traits. On the basis of over 200 fMRI studies, it tests alternative theoretical proposals that attempt to explain how several brain areas process information relevant for social cognition. The results sug...
متن کاملLatent Intention Dialogue Models
Developing a dialogue agent that is capable of making autonomous decisions and communicating by natural language is one of the long-term goals of machine learning research. Traditional approaches either rely on hand-crafting a small state-action set for applying reinforcement learning that is not scalable or constructing deterministic models for learning dialogue sentences that fail to capture ...
متن کامل